Automatic generation of domain-dependent pronunciation lexicon with data-driven rules and rule adaptation
نویسندگان
چکیده
In this paper, we describe a method for automatically generating a domain-dependent pronunciation lexicon using a data-driven approach. We also introduce an adaptation method to alleviate some of the errors caused by the data-driven rules which are derived from a relatively small volume of speech corpus. At first, pronunciation variation rules are extracted from a large volume of speech corpus and then are adapted using the rules derived from the target corpus. The context dependent pronunciation variants of the target lexicon are automatically generated by applying these rules to the training and language model adaptation text corpus. Then the pronunciation variants are pruned based on the likelihood of applied rules. Compared to the lexicon created by knowledgebased rules, on the Korean spontaneous speech corpus, our approach produces an absolute reduction of 0.8% of the WER. Furthermore, the size of pronunciation variants is reduced by almost 5.6% on the peak performance.
منابع مشابه
Joint pronunciation modelling of non-native speakers using data-driven methods
Modelling non-native speakers with different mother tongues is a difficult task for automatic speech recognition due to the large variation among speakers. One possibility for jointly modelling all speakers is to use the same speaker independent acoustic models and a joint lexicon to capture the variation. We have modified the reference lexicon using pronunciation rules that are derived in a to...
متن کاملData Driven Approaches to Phonetic Transcription with Integration of Automatic Speech Recognition and Grapheme-to-Phoneme for Spoken Buddhist Sutra
We propose a new approach for performing phonetic transcription of text that utilizes automatic speech recognition (ASR) to help traditional grapheme-to-phoneme (G2P) techniques. This approach was applied to transcribe Chinese text into Taiwanese phonetic symbols. By augmenting the text with speech and using automatic speech recognition with a sausage searching net constructed from multiple pro...
متن کاملAutomatic Learning and Optimization of Pronunciation Dictionaries
Pronunciation dictionaries are the interface between orthographic and phonetic representation of the speech signal and are thereby a substantial component of speech recognition systems. In many systems simple canonical pronunciation forms are used within the dictionary. They represent the “correct” pronunciation as they are found in lexicons and neither contain the most frequent pronunciation n...
متن کاملMultiple-Pronunciation Lexical Modeling Based on Phoneme Confusion Matrix for Dysarthric Speech Recognition
In this paper, we propose speaker-dependent multiple-pronunciation lexical modeling for improving the performance of dysarthric automatic speech recognition (ASR). For each dysarthric speaker, a phoneme confusion matrix is first constructed from the results of phoneme recognition. Then, pronunciation variation rules are extracted by investigating the phoneme confusion matrix, and they are incor...
متن کاملRule−based Categorial Analysis of Unprompted Speech − a Cross−language Study
In this study, we investigated the influence of language specifics in a cross-language task on the automatic segmentation with a self-learning algorithm for the integration of pronunciation rules. The goal of this paper is to present the linguistic and statistic results of a new method to automatically generate pronunciation rules for automatic segmentation of speech the German MAUSER system. M...
متن کامل